Automated real time interpreter service

ABSTRACT

Aspects of the present disclosure relate to systems and methods that aid users with hearing and/or speech impediments to have a conversation with a remote phone number without human assistance. In one aspect, an application residing on a user&#39;s device, such as a smartphone, tablet computer, laptop, etc., may be used to initiate a phone call to a recipient. Upon initiating the phone call locally, a service residing on the server may receive a request to initiate a connection to the recipient. Once the recipient answers, the user may converse with the recipient by providing text input to their local app. The text input may be transmitted to the service. The service may use a text to speech converter to translate the received text to speech that can be delivered to the recipient.

This application is a continuation of U.S. patent application Ser. No.15/730,176, filed on Oct. 11, 2017, now issued as U.S. Pat. No.10,854,110, which application claims priority to U.S. Provisional PatentApplication No. 62/466,859, entitled “Automated Real Time InterpreterService,” filed on Mar. 3, 2017, the disclosures of which are herebyincorporated by reference in their entireties.

BACKGROUND

Users with hearing and/or speech impairments cannot make phone callsunassisted. In order to make a phone call, a user relies upon aninterpreter which can be costly and requires scheduling time with aninterpreter in advance.

It is with respect to these and other general considerations that theaspects disclosed herein have been made. Also, although relativelyspecific problems may be discussed, it should be understood that theexamples should not be limited to solving the specific problemsidentified in the background or elsewhere in this disclosure.

SUMMARY

Aspects of the present disclosure relate to systems and methods that aidusers with hearing and/or speech impediments to have a conversation witha remote phone number without human assistance. In one aspect, anapplication residing on a user's device, such as a smartphone, tabletcomputer, laptop, etc., may be used to initiate a phone call to arecipient. Upon initiating the phone call locally, a service residing onthe server may receive a request to initiate a connection to therecipient. Once the recipient answers, the user may converse with therecipient by providing text input to their local app. The text input maybe transmitted to the service. The service may use a text to speechconverter to translate the received text to speech that can be deliveredto the recipient. Spoken responses from the recipient may also bereceived by the service and translated into text. The translated textmay be provided to the application on the user's local device fordisplay. The service may also provide additional features such as genderrecognition, voice selection, amplitude association for transcriptionvisualization, and emotion recognition for transcription visualization,among other services.

This Summary is provided to introduce a selection of concepts in asimplified form that are further described below in the DetailedDescription. This Summary is not intended to identify key features oressential features of the claimed subject matter, nor is it intended tobe used to limit the scope of the claimed subject matter. Additionalaspects, features, and/or advantages of examples will be set forth inpart in the description which follows and, in part, will be apparentfrom the description, or may be learned by practice of the disclosure.

BRIEF DESCRIPTION OF THE DRAWINGS

Non-limiting and non-exhaustive examples are described with reference tothe following figures.

FIG. 1 illustrates an overview of an exemplary system for providing anautomated interpreter service.

FIG. 2 illustrates an exemplary method 200 of providing an automatedinterpreter service.

FIG. 3 is a block diagram illustrating example physical components of acomputing device with which aspects of the disclosure may be practiced.

FIGS. 4A and 4B are simplified block diagrams of a mobile computingdevice with which aspects of the present disclosure may be practiced.

FIG. 5 is a simplified block diagram of a distributed computing systemin which aspects of the present disclosure may be practiced.

FIG. 6 illustrates a tablet computing device for executing one or moreaspects of the present disclosure.

DETAILED DESCRIPTION

Various aspects of the disclosure are described more fully below withreference to the accompanying drawings, which form a part hereof, andwhich show specific example aspects. However, different aspects of thedisclosure may be implemented in many different forms and should not beconstrued as limited to the aspects set forth herein; rather, theseaspects are provided so that this disclosure will be thorough andcomplete, and will fully convey the scope of the aspects to thoseskilled in the art. Aspects may be practiced as methods, systems ordevices. Accordingly, aspects may take the form of a hardwareimplementation, an entirely software implementation or an implementationcombining software and hardware aspects. The following detaileddescription is, therefore, not to be taken in a limiting sense.

Aspects of the present disclosure relate to systems and methods that aidusers with hearing and/or speech impediments to have a conversation witha remote phone number without human assistance. Historically atelecommunication relay service was used to aid in phone calls. Atelecommunication relay service involves having a third partyinterpreter sitting in the middle of the call to help communicate withboth the parties. The human interpreter receives the text message fromthe caller, speaks that to the call receiver, hears their reply andtypes it back to the caller. This often involves having dedicatedhardware to type and receive text output. Some hardware for Video relayservice enables user to communicate through sign language to theinterpreter.

Telecommunication relay services have a number of significant drawbacks.For example, the reliance on an interpreter is costly and often requiresa user to schedule time with the interpreter in advanced. Use of thisservices also result in a sever loss of privacy due to the involvementof a third party. Additionally, users may not be able to make somespecific types of calls due to the presence of an interpreter. Forexample, federal laws prohibit anyone other than the user to call banks,credit card companies, and other agencies where highly private andcritical information exchange such as credit cards, pin codes isinvolved. Even if the use of an interpreter is not prohibited by law,there have been several reported cases of fraud performed by the thirdparty services. Finally, the dedicated hardware required to use atelecommunication relay service.

Aspects of the present disclosure address these and other problems byproviding an automated service that allows users with hearing or speechimpediments to make a phone call. In one aspect, an application residingon a user's device, such as a smartphone, tablet computer, laptop, etc.,may be used to initiate a phone call to a recipient. Upon initiating thephone call locally, a service residing on the server may receive arequest to initiate a connection to the recipient. Once the recipientanswers, the user may converse with the recipient by providing textinput to their local app. The text input may be transmitted to theservice. The service may use a text to speech converter to translate thereceived text to speech that can be delivered to the recipient. Spokenresponses from the recipient may also be received by the service andtranslated into text. The translated text may be provided to theapplication on the user's local device for display. The service may alsoprovide additional features such as gender recognition, voice selection,amplitude association for transcription visualization, and emotionrecognition for transcription visualization, among other services.

FIG. 1 illustrates an overview of an exemplary system 100 for providingan automated interpreter service. System 100 may include a user device102 that is operable to connect to a server 104 via a network 110. Userdevice 102 may be any type of computing device operable to receivetextual input and display responses. Exemplary devices include, but arenot limited to, smartphones, laptops, tablet computers, etc. Aninterpreter Application 106 may be used to initiate a call with arecipient. The interpreter application 106 may initiate a connectionwith an interpreter service 108 residing on server 104. The interpreterservice 108 may establish a connection with one or more recipientdevices, such as recipient devices 112A-C depicted in system 100. Uponinitiated a connection with a recipient device, such as, for example,recipient device 112A, the interpreter service 108 may transmitprerecorded audio to the recipient device. The prerecorded audio mayinform the recipient that an automated interpreter service 108 is beingused to facilitate a call with a hearing or speech impaired user.

Upon establishing the connection, interpreter application 106 mayreceive text input from a user via a user interface. The text inputreceived by interpreter application 106 may be transmitted to theinterpreter service 108 via the network 110. Upon receiving the text,interpreter service 108 may process the text in order to convert thetext to speech. The speech generated by the interpreter service may thenbe delivered to the recipient device 112A.

The interpreter service 108 may also be operable to receive spoken inputfrom a recipient device, such as recipient device 112A. The receivedspoken input may be process in order to convert the spoken input totext. The converted text may then be delivered to user device 112. Theinterpreter application 106 may receive the converted text and displaythe converted text to the user. In doing so, the interpreter service 108may facilitate a phone conversation between a hearing or speech impaireduser and a recipient.

Additional features may also be provided by the system 100. Oneadditional service may be gender recognition. The interpreter service108 may process a recipient's voice in order to determine if therecipient is a male or female. Upon determining the recipient's gender,the gender information may be transmitted to the interpreter application106. The interpreter application 106 may use the gender information todisplay an indication to the user of the recipient's gender.

Another additional service may be voice selection for the user. Uponinitiating the call, or at an earlier time, the user may provideinformation about the user to the interpreter service 108. The providedinformation may be used by the interpreter service to select a gender,age, and nationality for the user. The selected gender, age, andnationality may be used to select an appropriate voice for a text tospeech process. In doing so, the interpreter service 108 may select avoice that represents the user's gender, age, and nationality, therebyconveying information about the user to the recipient. In certainaspects, a user may be able to change their gender, age, and/ornationality. In response, the interpreter service 108 may select adifferent speech to text process for the user.

The interpreter service 108 may also perform amplitude association. Inone example, upon initiating a connection with the recipient, theinterpreter service 108 may initially analyze the amplitude of thespeech received from the recipient in order to determine a baselineamplitude. Upon determining the baseline amplitude, the interpreterservice 108 may compare newly received speech from the recipient againstthe baseline amplitude to determine if the recipient is speaking louderor softer. In one example, changes in amplitude may be conveyed to theinterpreter application 106. The interpreter application 106 may usechanges in the amplitude to alter the display of the text presented tothe user. For example, if the recipient is speaking louder, such asshouting, the font size of the displayed text may be increased. If therecipient is speaking softer, such as whispering, the font size of thedisplayed text may be decreased. While examples provided herein describethe interpreter application 106 as determining a font size for thedisplayed text based upon amplitude information, in alternate examplesthe determination of text size may be performed by the interpreterservice 108.

Emotional recognition may also be provided by the system 100. Uponreceiving speech from the recipient, the interpreter service 108 maydetermine the recipient's current emotional state (e.g., angry, sad,happy, excited, neutral). The determined emotional state may be used toalter the display of the displayed text. For example, the sentence ofthe transcribed text displayed to the user may be displayed in a certaincolor depending on the recipient's emotional state.

While the aspects described thus far relate to a conversation between auser and a single recipient, in further aspects the interpreter servicemay be extended to a multi-caller conference call. In such examples,upon receiving speech from a recipient device, the interpreter service108 may transcribe the received speech and identify the recipient whoprovided the speech. The recipient's name may then be transmitted to theinterpreter application 106 with the transcribed sentence. Theinterpreter application 106 may then display the transcribed sentence inassociation with the recipient's name, thereby allowing the user to knowwho spoke the transcribed text. In one example, the recipient may beidentified based upon the device that transmitted the spoken input tothe interpretation service 108.

In yet another example, the interpreter service 108 may be operable todetermine when multiple recipients are speaking into a single device.For example, voice analysis may be performed to identify differentspeakers. When multiple speakers are identified, the recipient service108 may transmit an indication of which speaker should be associatedwith the transcribed text. The interpreter application 106 may displayan indication of which speaker spoke the transcribed text whendisplaying the text.

FIG. 2 illustrates an exemplary method 200 of providing an automatedinterpreter service. In one example, the method 200 may be performed bya device, such as server device 104. Flow begins at operation 200 wherea call is received from a device. In one example, the received call maybe initiated by an interpreter application residing on a user's device.Upon receiving the call, flow continues to operation 202. At operation202, a call may be initiated with a remote device. The remote device maybe identified by information received at operation 202.

Flow continues to operation 206 where an initial message is transmittedto the remote device. The initial message may be a preset audio. Thepreset audio may be used to inform the recipient that they are speakingwith the user via an automated interpreter service. Flow may thencontinue to optional operation 208. In some aspects, the recipient mayhave to consent to speak with the user via the automated interpreterservice. In such aspects, consent may be received at optional operation208.

Flow continues to operation 210 where data is received. The receiveddata may originate from the recipient device or the users device. Thereceived data may be spoken audio or text. Flow continues to decisionoperation 212 where a determination is made as to whether the receiveddata is from an interpreter application. If the received data is fromthe interpreter application, flow branches yes to operation 214. Atoperation 214, text data received from the interpreter application isconverted to speech. The converted speech may then be sent to therecipient device. Upon sending the converted text to the recipientdevice, flow continues to decision operation 218.

Returning to decision operation 212, if the data is not received fromthe interpreter application, that is, the data is spoken data receivedby the recipient, flow branches No to operation 216. At operation 216,the speech received from the recipient device is converted to text andsent to the interpreter application. Flow then continues to decisionoperation 218 where a determination is made as to whether the call isended. If the call has finished, flow branches Yes and the method 200terminates. If, however, the call has not ended, flow branches No andreturns to operation 210. The method 200 then continues in a loop untilthe call has finished.

FIGS. 3-6 and the associated descriptions provide a discussion of avariety of operating environments in which aspects of the disclosure maybe practiced. However, the devices and systems illustrated and discussedwith respect to FIGS. 3-6 are for purposes of example and illustrationand are not limiting of a vast number of computing device configurationsthat may be utilized for practicing aspects of the disclosure, describedherein.

FIG. 3 is a block diagram illustrating physical components (e.g.,hardware) of a computing device 300 with which aspects of the disclosuremay be practiced. The computing device components described below may besuitable for the computing devices described above, including the clientcomputing devices 102A-C and the server computing devices 106A-C. In abasic configuration, the computing device 300 may include at least oneprocessing unit 302 and a system memory 304. Depending on theconfiguration and type of computing device, the system memory 304 maycomprise, but is not limited to, volatile storage (e.g., random accessmemory), non-volatile storage (e.g., read-only memory), flash memory, orany combination of such memories. The system memory 304 may include anoperating system 305 and one or more program modules 306 suitable forperforming the various aspects disclosed herein such interpreterapplication 320 and interpreter service 322 324 and a queryreformulation component 326. The operating system 305, for example, maybe suitable for controlling the operation of the computing device 300.Furthermore, embodiments of the disclosure may be practiced inconjunction with a graphics library, other operating systems, or anyother application program and is not limited to any particularapplication or system. This basic configuration is illustrated in FIG. 3by those components within a dashed line 308. The computing device 300may have additional features or functionality. For example, thecomputing device 300 may also include additional data storage devices(removable and/or non-removable) such as, for example, magnetic disks,optical disks, or tape. Such additional storage is illustrated in FIG. 3by a removable storage device 309 and a non-removable storage device310.

As stated above, a number of program modules and data files may bestored in the system memory 304. While executing on the processing unit302, the program modules 306 (e.g., interpreter application 320 and/orinterpreter service 322) may perform processes including, but notlimited to, the aspects, as described herein. Other program modules thatmay be used in accordance with aspects of the present disclosure mayinclude electronic mail and contacts applications, word processingapplications, spreadsheet applications, database applications, slidepresentation applications, drawing or computer-aided applicationprograms, etc.

Furthermore, embodiments of the disclosure may be practiced in anelectrical circuit comprising discrete electronic elements, packaged orintegrated electronic chips containing logic gates, a circuit utilizinga microprocessor, or on a single chip containing electronic elements ormicroprocessors. For example, embodiments of the disclosure may bepracticed via a system-on-a-chip (SOC) where each or many of thecomponents illustrated in FIG. 3 may be integrated onto a singleintegrated circuit. Such an SOC device may include one or moreprocessing units, graphics units, communications units, systemvirtualization units and various application functionality all of whichare integrated (or “burned”) onto the chip substrate as a singleintegrated circuit. When operating via an SOC, the functionality,described herein, with respect to the capability of client to switchprotocols may be operated via application-specific logic integrated withother components of the computing device 300 on the single integratedcircuit (chip). Embodiments of the disclosure may also be practicedusing other technologies capable of performing logical operations suchas, for example, AND, OR, and NOT, including but not limited tomechanical, optical, fluidic, and quantum technologies. In addition,embodiments of the disclosure may be practiced within a general purposecomputer or in any other circuits or systems.

The computing device 300 may also have one or more input device(s) 312such as a keyboard, a mouse, a pen, a sound or voice input device, atouch or swipe input device, etc. The output device(s) 314 such as adisplay, speakers, a printer, etc. may also be included. Theaforementioned devices are examples and others may be used. Thecomputing device 300 may include one or more communication connections316 allowing communications with other computing devices 350. Examplesof suitable communication connections 316 include, but are not limitedto, radio frequency (RF) transmitter, receiver, and/or transceivercircuitry; universal serial bus (USB), parallel, and/or serial ports.

The term computer readable media as used herein may include computerstorage media. Computer storage media may include volatile andnonvolatile, removable and non-removable media implemented in any methodor technology for storage of information, such as computer readableinstructions, data structures, or program modules. The system memory304, the removable storage device 309, and the non-removable storagedevice 310 are all computer storage media examples (e.g., memorystorage). Computer storage media may include RAM, ROM, electricallyerasable read-only memory (EEPROM), flash memory or other memorytechnology, CD-ROM, digital versatile disks (DVD) or other opticalstorage, magnetic cassettes, magnetic tape, magnetic disk storage orother magnetic storage devices, or any other article of manufacturewhich can be used to store information and which can be accessed by thecomputing device 300. Any such computer storage media may be part of thecomputing device 300. Computer storage media does not include a carrierwave or other propagated or modulated data signal.

Communication media may be embodied by computer readable instructions,data structures, program modules, or other data in a modulated datasignal, such as a carrier wave or other transport mechanism, andincludes any information delivery media. The term “modulated datasignal” may describe a signal that has one or more characteristics setor changed in such a manner as to encode information in the signal. Byway of example, and not limitation, communication media may includewired media such as a wired network or direct-wired connection, andwireless media such as acoustic, radio frequency (RF), infrared, andother wireless media.

FIGS. 4A and 4B illustrate a mobile computing device 400, for example, amobile telephone, a smart phone, wearable computer (such as a smartwatch), a tablet computer, a laptop computer, and the like, with whichembodiments of the disclosure may be practiced. In some aspects, theclient may be a mobile computing device. With reference to FIG. 4A, oneaspect of a mobile computing device 400 for implementing the aspects isillustrated. In a basic configuration, the mobile computing device 400is a handheld computer having both input elements and output elements.The mobile computing device 400 typically includes a display 405 and oneor more input buttons 410 that allow the user to enter information intothe mobile computing device 400. The display 405 of the mobile computingdevice 400 may also function as an input device (e.g., a touch screendisplay). If included, an optional side input element 415 allows furtheruser input. The side input element 415 may be a rotary switch, a button,or any other type of manual input element. In alternative aspects,mobile computing device 400 may incorporate more or less input elements.For example, the display 405 may not be a touch screen in someembodiments. In yet another alternative embodiment, the mobile computingdevice 400 is a portable phone system, such as a cellular phone. Themobile computing device 400 may also include an optional keypad 435.Optional keypad 435 may be a physical keypad or a “soft” keypadgenerated on the touch screen display. In various embodiments, theoutput elements include the display 405 for showing a graphical userinterface (GUI), a visual indicator 420 (e.g., a light emitting diode),and/or an audio transducer 425 (e.g., a speaker). In some aspects, themobile computing device 400 incorporates a vibration transducer forproviding the user with tactile feedback. In yet another aspect, themobile computing device 400 incorporates input and/or output ports, suchas an audio input (e.g., a microphone jack), an audio output (e.g., aheadphone jack), and a video output (e.g., a HDMI port) for sendingsignals to or receiving signals from an external device.

FIG. 4B is a block diagram illustrating the architecture of one aspectof a mobile computing device. That is, the mobile computing device 400can incorporate a system (e.g., an architecture) 402 to implement someaspects. In one embodiment, the system 402 is implemented as a “smartphone” capable of running one or more applications (e.g., browser,e-mail, calendaring, contact managers, messaging clients, games, andmedia clients/players). In some aspects, the system 402 is integrated asa computing device, such as an integrated personal digital assistant(PDA) and wireless phone.

One or more application programs 466 may be loaded into the memory 462and run on or in association with the operating system 464. Examples ofthe application programs include phone dialer programs, e-mail programs,personal information management (PIM) programs, word processingprograms, spreadsheet programs, Internet browser programs, messagingprograms, and so forth. The system 402 also includes a non-volatilestorage area 468 within the memory 462. The non-volatile storage area468 may be used to store persistent information that should not be lostif the system 402 is powered down. The application programs 466 may useand store information in the non-volatile storage area 468, such ase-mail or other messages used by an e-mail application, and the like. Asynchronization application (not shown) also resides on the system 402and is programmed to interact with a corresponding synchronizationapplication resident on a host computer to keep the information storedin the non-volatile storage area 468 synchronized with correspondinginformation stored at the host computer. As should be appreciated, otherapplications may be loaded into the memory 462 and run on the mobilecomputing device 400 described herein (e.g., search engine, extractormodule, relevancy ranking module, answer scoring module, etc.).

The system 402 has a power supply 470, which may be implemented as oneor more batteries. The power supply 470 might further include anexternal power source, such as an AC adapter or a powered docking cradlethat supplements or recharges the batteries.

The system 402 may also include a radio interface layer 472 thatperforms the function of transmitting and receiving radio frequencycommunications. The radio interface layer 472 facilitates wirelessconnectivity between the system 402 and the “outside world,” via acommunications carrier or service provider. Transmissions to and fromthe radio interface layer 472 are conducted under control of theoperating system 464. In other words, communications received by theradio interface layer 472 may be disseminated to the applicationprograms 466 via the operating system 464, and vice versa.

The visual indicator 420 may be used to provide visual notifications,and/or an audio interface 474 may be used for producing audiblenotifications via the audio transducer 425. In the illustratedembodiment, the visual indicator 420 is a light emitting diode (LED) andthe audio transducer 425 is a speaker. These devices may be directlycoupled to the power supply 470 so that when activated, they remain onfor a duration dictated by the notification mechanism even though theprocessor 460 and other components might shut down for conservingbattery power. The LED may be programmed to remain on indefinitely untilthe user takes action to indicate the powered-on status of the device.The audio interface 474 is used to provide audible signals to andreceive audible signals from the user. For example, in addition to beingcoupled to the audio transducer 425, the audio interface 474 may also becoupled to a microphone to receive audible input, such as to facilitatea telephone conversation. In accordance with embodiments of the presentdisclosure, the microphone may also serve as an audio sensor tofacilitate control of notifications, as will be described below. Thesystem 402 may further include a video interface 476 that enables anoperation of an on-board camera 430 to record still images, videostream, and the like.

A mobile computing device 400 implementing the system 402 may haveadditional features or functionality. For example, the mobile computingdevice 400 may also include additional data storage devices (removableand/or non-removable) such as, magnetic disks, optical disks, or tape.Such additional storage is illustrated in FIG. 4B by the non-volatilestorage area 468.

Data/information generated or captured by the mobile computing device400 and stored via the system 402 may be stored locally on the mobilecomputing device 400, as described above, or the data may be stored onany number of storage media that may be accessed by the device via theradio interface layer 472 or via a wired connection between the mobilecomputing device 400 and a separate computing device associated with themobile computing device 400, for example, a server computer in adistributed computing network, such as the Internet. As should beappreciated such data/information may be accessed via the mobilecomputing device 400 via the radio interface layer 472 or via adistributed computing network. Similarly, such data/information may bereadily transferred between computing devices for storage and useaccording to well-known data/information transfer and storage means,including electronic mail and collaborative data/information sharingsystems.

FIG. 5 illustrates one aspect of the architecture of a system forprocessing data received at a computing system from a remote source,such as a personal computer 504, tablet computing device 506, or mobilecomputing device 508, as described above. Content displayed at serverdevice 502 may be stored in different communication channels or otherstorage types. For example, various documents may be stored using adirectory service 522, a web portal 524, a mailbox service 526, aninstant messaging store 528, or a social networking site 530.Interpreter application 521 may be employed by a client thatcommunicates with server device 502, and/or interpreter service 520 maybe employed by server device 502. The server device 502 may provide datato and from a client computing device such as a personal computer 504, atablet computing device 506 and/or a mobile computing device 508 (e.g.,a smart phone) through a network 515. By way of example, the computersystem described above may be embodied in a personal computer 504, atablet computing device 506 and/or a mobile computing device 508 (e.g.,a smart phone). Any of these embodiments of the computing devices mayobtain content from the store 516, in addition to receiving graphicaldata useable to be either pre-processed at a graphic-originating system,or post-processed at a receiving computing system.

FIG. 6 illustrates an exemplary tablet computing device 600 that mayexecute one or more aspects disclosed herein. In addition, the aspectsand functionalities described herein may operate over distributedsystems (e.g., cloud-based computing systems), where applicationfunctionality, memory, data storage and retrieval and various processingfunctions may be operated remotely from each other over a distributedcomputing network, such as the Internet or an intranet. User interfacesand information of various types may be displayed via on-board computingdevice displays or via remote display units associated with one or morecomputing devices. For example user interfaces and information ofvarious types may be displayed and interacted with on a wall surfaceonto which user interfaces and information of various types areprojected. Interaction with the multitude of computing systems withwhich embodiments of the invention may be practiced include, keystrokeentry, touch screen entry, voice or other audio entry, gesture entrywhere an associated computing device is equipped with detection (e.g.,camera) functionality for capturing and interpreting user gestures forcontrolling the functionality of the computing device, and the like.

Aspects of the present disclosure, for example, are described above withreference to block diagrams and/or operational illustrations of methods,systems, and computer program products according to aspects of thedisclosure. The functions/acts noted in the blocks may occur out of theorder as shown in any flowchart. For example, two blocks shown insuccession may in fact be executed substantially concurrently or theblocks may sometimes be executed in the reverse order, depending uponthe functionality/acts involved.

The description and illustration of one or more aspects provided in thisapplication are not intended to limit or restrict the scope of thedisclosure as claimed in any way. The aspects, examples, and detailsprovided in this application are considered sufficient to conveypossession and enable others to make and use the best mode of claimeddisclosure. The claimed disclosure should not be construed as beinglimited to any aspect, example, or detail provided in this application.Regardless of whether shown and described in combination or separately,the various features (both structural and methodological) are intendedto be selectively included or omitted to produce an embodiment with aparticular set of features. Having been provided with the descriptionand illustration of the present application, one skilled in the art mayenvision variations, modifications, and alternate aspects falling withinthe spirit of the broader aspects of the general inventive conceptembodied in this application that do not depart from the broader scopeof the claimed disclosure.

1.-20. (canceled)
 21. A method of a real-time communication initiated bya voice caller to a non-voice callee, the method comprising: receiving,by an interpreter application used by the non-voice callee, voice callinitiation data from the voice caller, wherein the interpreterapplication communicates the voice call initiation data to the non-voicecallee based on text data; establishing a voice phone call with thevoice caller based on the voice call initiation data; receiving, by theinterpreter application, speech data from the voice caller; converting,by the interpreter application, the speech data to text data; andproviding the text data to the non-voice callee.
 22. The method of claim21, wherein the interpreter application is associated with a service.23. The method of claim 21, further comprising: analyzing the speechdata to determine a gender of the voice caller; and providing the textdata to the non-voice callee with an indication of the gender of thevoice caller.
 24. The method of claim 23, wherein the indication of thegender is a font of the text data.
 25. The method of claim 21, furthercomprising: analyzing the speech data to determine an identity of thevoice caller; and providing the text data to the non-voice callee withan indication of the identity of the voice caller.
 26. The method ofclaim 21, further comprising: analyzing the speech data to determine anemotional state of the voice caller; and modifying the text data toindicate the emotional state of the voice caller.
 27. The method ofclaim 21, further comprising: analyzing the speech data to determine anamplitude; and modifying the text data to indicate the amplitude of thespeech data.
 28. The method of claim 21, further comprising: analyzingthe speech data to determine one or more of: a gender of the voicecaller, an identity of the voice caller, an emotional state of the voicecaller, or an amplitude; and providing the text data to the non-voicecallee with one or more indications based on the determining.
 29. Asystem, comprising: at least one processor; and at least one memorystoring computer-executable instructions that when executed cause the atleast one processor to: receive, by an interpreter application used by anon-voice callee, voice call initiation data, wherein the voice callinitiation data is operable to establish a voice phone call between thenon-voice callee and two or more voice callers; receive, by theinterpreter application, speech data from the two or more voice callers;associate a first portion of the speech data with a first voice callerof the two or more voice callers and a second portion of the speech datawith a second voice caller of the two or more voice callers; convert, bythe interpreter application, the first portion of the speech data tofirst text data associated with the first voice caller of the two ormore voice callers and the second portion of the speech data to secondtext data associated with the second voice caller of the two or morevoice callers; and provide the first text data and the second text datato the non-voice callee.
 30. The system of claim 29, thecomputer-executable instructions when executed further causing the atleast one processor to: analyze the first portion of the speech data todetermine a first identity of the first voice caller and the secondportion of the speech data to determine a second identity of the secondvoice caller; and provide the first text data with a first voice calleridentifier of the first voice caller and the second text data with asecond voice caller identifier of the second voice caller.
 31. Thesystem of claim 29, wherein the interpreter application is associatedwith a service.
 32. The system of claim 29, wherein the voice phone callis a conference call.
 33. The system of claim 29, thecomputer-executable instructions when executed further causing the atleast one processor to: analyze the first portion of the speech data todetermine a first gender of the first voice caller and the secondportion of the speech data to determine a second gender of the secondvoice caller; and provide the first text data with a first indication ofthe first gender of the first voice caller and the second text data witha second indication of the second gender of the second voice caller. 34.The system of claim 33, wherein the first gender is different than thesecond gender, and wherein the first indication of the first gender isdifferent than the second indication of the second gender.
 35. Thesystem of claim 34, wherein the first indication of the first gender isa first font of the first text data and the second indication of thesecond gender is a second font of the second text data.
 36. The systemof claim 33, wherein the first gender is the same as the second gender,and wherein the first indication of the first gender is the same as thesecond indication of the second gender.
 37. The system of claim 29, thecomputer-executable instructions when executed further causing the atleast one processor to: analyze the first portion of the speech data todetermine a first emotional state of the first voice caller and thesecond portion of the speech data to determine a second emotional stateof the second voice caller; and modify the first text data to indicatethe first emotional state of the first voice caller and the second textdata to indicate the second emotional state of the second voice caller.38. The system of claim 37, wherein the first emotional state isdifferent than the second emotional state, and wherein the indication ofthe first emotional state is different than the indication of the secondemotional state.
 39. A computer storage medium storingcomputer-executable instructions that when executed cause at least oneprocessor to: receive, by an interpreter application used by a non-voicecallee, voice call initiation data, wherein the voice call initiationdata is operable to establish a voice phone call between the non-voicecallee and a voice caller; receive, by the interpreter application,speech data from the voice caller; analyze the speech data to determinean identity of the voice caller; convert, by the interpreterapplication, the speech data to text data; and provide the text data tothe non-voice callee with an identifier of the voice caller.
 40. Thecomputer storage medium of claim 39, the computer-executableinstructions when executed further causing the at least one processorto: analyze the speech data to determine a gender of the voice caller;and provide the text data to the non-voice callee in a font associatedwith the gender of the voice caller.